Speech Recognition Using a Discriminative , Context - Independent , Segment - Based SpeechRecognizerJan

نویسندگان

  • Jan Verhasselt
  • Jean-Pierre Martens
  • Bart Baeyens
چکیده

| In this paper, we describe important improvements that were recently introduced in our Discriminative Stochastic Segment Model (DSSM) speech recognizer. We propose a new presegmen-tation algorithm and we optimize the structure of the Multi-Layer Perceptron (MLP) that estimates the phone probabilities. Additionally, we describe a cascade MLP combination technique that relaxes the drawbacks of traditional stochastic segment models. The proposed improvements have resulted in a statistically signiicant increase of the speaker-independent continuous phone recognition performance on the TIMIT corpus.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Allophone-based acoustic modeling for Persian phoneme recognition

Phoneme recognition is one of the fundamental phases of automatic speech recognition. Coarticulation which refers to the integration of sounds, is one of the important obstacles in phoneme recognition. In other words, each phone is influenced and changed by the characteristics of its neighbor phones, and coarticulation is responsible for most of these changes. The idea of modeling the effects o...

متن کامل

An Information-Theoretic Discussion of Convolutional Bottleneck Features for Robust Speech Recognition

Convolutional Neural Networks (CNNs) have been shown their performance in speech recognition systems for extracting features, and also acoustic modeling. In addition, CNNs have been used for robust speech recognition and competitive results have been reported. Convolutive Bottleneck Network (CBN) is a kind of CNNs which has a bottleneck layer among its fully connected layers. The bottleneck fea...

متن کامل

Context-Dependent Modeling in a Segment-Based Speech Recognition System

The goal of this thesis is to explore various strategies for incorporating contextual information into a segment-based speech recognition system, while maintaining computational costs at a level acceptable for implementation in a real-time system. The latter is achieved by using context-independent models in the search, while contextdependent models are reserved for re-scoring the hypotheses pr...

متن کامل

Context - Dependent Modeling in a Segment - BasedSpeech Recognition

The goal of this thesis is to explore various strategies for incorporating contextual information into a segment-based speech recognition system, while maintaining computational costs at a level acceptable for implementation in a real-time system. The latter is achieved by using context-independent models in the search, while context-dependent models are reserved for re-scoring the hypotheses p...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007